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Abstract. We prove L p estimates (Theorem 1.3) for the Bi-Carleson operator defined 
below. The methods used are essentially based on the treatment of the Walsh analogue 
of the operator in the prequel [11] of this paper, but with additional technicalities due to 
the fact that in the Fourier model one cannot obtain perfect localization in both space 
and frequency. 



I. INTRODUCTION 

The maximal Carleson operator is the sub-linear operator defined by 

C(/)(;r):=sup / f(Oe 2mx ^ , (1) 

N J£<N 

where / is a Schwartz function on the real line R and the Fourier transform is defined by 

HO := I f(x)e- 2 ^dx. 

m 

The following statement of Carleson and Hunt [1], [3] is a classical theorem in Fourier 
analysis: 

Theorem 1.1. The operator C maps L p — ► L p , for every 1 < p < oo. 

This result, in the particular weak type (2, 2) special case, was the main ingredient in 
the proof of Carleson's famous theorem which states that the Fourier series of a function 
in L 2 (R/Z) converges pointwise almost everywhere. 

The bilinear Hilbert transform is an operator which can be essentially written as 

B(fi,h)(x):= [ A(6)/ 2 (6)e 27ri ^ 1+6) ^i^2, (2) 

where f\ , fi are test functions on R. From the work of Lacey and Thiele [5] we have the 
following IP estimates on B: 

Theorem 1.2. B maps 

L P1 X L P2 _> L P Z when 

ever 1 < pi,p2 < oo, l/p± + l/pz = l/p / 3 , 

and 2/3 < p' 3 < oo. 

The purpose of the present paper is to study the L p boundedness properties of the 
Bi-Carleson operator defined by 



T(f 1 ,f 2 )(x) := sup 

N 

Our main theorem is the following: 



A(£i)/ 2 (6)e 2 ™ (?1+6) ^i^ 

?i<6<A r 



(3) 
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Theorem 1.3. The Bi-Carleson operator T defined above maps L Pl x L P2 — > L p » as long 
as 1 < pi,p2 < oo ; 1/pi + I/P2 = 1/^3? an d 2/3 < p 3 < oo. 

This Bi-Carleson operator can be thought of as being a hybrid of the Carleson operator 
and the bilinear Hilbert transform. In fact, our main theorem above implies Carleson- 
Hunt's theorem and Lacey-Thiele's theorem as special cases (in Section 11 we also give 
an expository proof of Carleson-Hunt theorem). 

The interesting and beautiful fact about this operator (and its "relative" bi-est operator 
studied in [9], [10]) is that, unlike the Carleson operator or the bilinear Hilbert transform, 
it has a very special biparameter structure not seen among the previously studied operators 
in harmonic analysis. As the careful reader will notice, understanding this structure is 
the main challenge of the paper. 

In [11] Theorem 1.3 was proved for a Walsh- Fourier analogue Tu;aZs/i,p,Q °f T. From the 
point of view of time-frequency analysis the two operators are closely related, however the 
Walsh model is easier to analyze technically because it is possible to localize perfectly in 
both space and frequency simultaneously. In the Fourier case one has to deal with several 
"Schwartz tails" which introduce additional difficulties. For instance, in the Walsh model 
an inner product (</>p,</>q) of wave packets vanishes unless the spatial intervals Ip and Iq 
are nested; however in the Fourier model one needs to consider the case when I P and Iq 
are separated (although the estimates improve rapidly with the relative separation of Ip 
and I Q ). 

The study of T clearly reduces to the study of its linearized version defined by 

T(f u f 2 )(x) := [ A(6)/ 2 (6)e 27r ^ 1+6) ^i^2, (4) 

Jti<€2<N(x) 

where "x — » N(x)" is an arbitrary function. 

We recall now a theorem from [7] which will take care of an error term later on. 

Let m be a bounded function on R 2 , satisfying the classical Marcinkiewicz-Mihlin- 
Hormander condition 

\dfm(£)\ < (5) 
for sufficiently many multiindices a and define the multiplier operator 

r m (/i,/ 2 )(x) := / 2 m(6,6)/i(6)/ 2 (6)e 27ri ^ 1+6) ^i^2. 
JR 

Similarly, we consider the Carleson type operator associated to the symbol m and 
defined by 

C m (/i,/ 2 )(x) := sup |T Tj?ro (/i,/ 2 )(a:)| , (6) 
A?eR 2 

where r^m(£) := m(£ — N). 

In [7] the following theorem has been proven: 

Theorem 1.4. The operator T m maps L' Pl x L' P2 — > L p '^ as long as 1 < pi,p 2 < °°; 
l/pi + l/p2 = I/P3; an d < p' 3 < oo. 
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As in [11] it shall be convenient to split our linearized operator T into two pieces plus an 
"error term", which is an operator of the form (6). Specifically, fixiVel and construct 
the symbols m' N ,m'^,m% such that 

supp(m' w ) C {6 < 6; < N}, 

m' N is smooth away from the line £i = £ 2 and equals 1 on ^ = £ 2 , 

supp(m^) C <&<N}, 
m" N is smooth away from the line £2 = N and equals 1 on £ 2 = N, while 

^jv : = Xti<t2<N ~ m' N - m" N 

is smooth away from the vertex (TV, N) of the cone {£1 < £2 < N} and moreover, 
T -(N,N) m N satisfies (5) (with the corresponding constants independent of N). 

The multipliers m' N , m" N and m% will be carefully constructed in the forthcoming 
sections. Consequently, our operator T in (4) can be written as 



Since T'" can be estimated by Theorem 1.4, it is enough to prove that both T' and T" 
satisfy the conclusion of the main Theorem (1.3). 

While the current article is mostly self contained, a knowledge of the prequel [11] 
should help. However, we believe that the overall presentation of the ideas related to this 
problem, has been improved in the meantime. In particular, we managed to avoid the use 
of the complicated mixed sizes "size(/, g)" and energies "energy (/, g)" which appeared in 
the second half of [11]. 

The first author was partially supported by NSF. The second author is a Clay Prize 
Fellow and is supported by a grant from Packard Foundations. The third author was 
partially supported by a Sloan Fellowship and by NSF. 



2. Notation 

In this section we set out some general notation used throughout the paper. 

We use A < B to denote the statement that A < CB for some large constant C, and 
A ^ B to denote the statement that A < C~ l B for some large constant C. We will 
sometime write A ~ B and this means that A < B < A. Given any interval (or cube) J, 
we let I J I denote the measure of /, and cl denotes the interval (or cube) with the same 
center as / but c times the side-length. 

Given a spatial interval J, we shall define the approximate cutoff function xi by 

*/(*) == (i + (^|^) 2 r 1/2 , 

where xi is the center of I. 

A collection {u} of intervals is said to be lacunary around the frequency £ if we have 
dist(£,u;) ~ for all uj in the collection. 
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3. INTERPOLATION 

In this section we review the interpolation theory from [8] which allows us to reduce 
multi-linear L p estimates such as those in Theorem 1.3 to certain "restricted type" esti- 
mates. 

To prove the L p estimates on T it is convenient to use duality and introduce the trilinear 
form A associated to T via the formula 

A(/i,/ 2 ,/ 3 ) := / T{h,f 2 ){x)h{x)dx. 
</R 

Similarly, define A' and A" associated with T' and T" respectively. The statement that T 
is bounded from L Pl x L P2 to L p ^ is then equivalent to A being bounded on L Pl x L P2 x L Pa 
if 1 < p' 3 < oo. For < 1 this simple duality relationship breaks down, however the 
interpolation arguments in [8] will allow us to reduce our desired estimate to certain 
"restricted type" estimates on A. As in [11] we find more convenient to work with the 
quantities = 1/pi, i — 1, 2, 3, where pi stands for the exponent of L Pi . 

Definition 3.1. A tuple a = (a±, a 2 , a 3 ) is called admissible, if 

— OO < Ojj < 1 

for all 1 < i < 3, 

3 

^Oii = 1 

i=i 

and there is at most one index j such that ctj < 0. We call an index i good if ai > 0, and 
we call it bad if ai < 0. A good tuple is an admissible tuple without bad index, a bad tuple 
is an admissible tuple with a bad index. 

Definition 3.2. Let E, E' be sets of finite measure. We say that E' is a major subset of 
E if E' C E and \E'\ > \\E\. 

Definition 3.3. If E is a set of finite measure, we denote by X{E) the space of all 
measurable complex-valued functions f supported on E and such that \\f\\oo < 1- 

Definition 3.4. If a = (ai,a 2 ,a 3 ) is an admissible bad tuple with bad index j , we say 
that a 3 -linear form A is of restricted type a if for every sequence E ± , E 2 , E 3 of subsets of 
R with finite measure, there exists a major subset E[- of Ej such that 

|A(/i,/ 2 ,/s)| <\E\ a 

for all functions /j G X(El), i — 1, 2, 3, where we adopt the convention E[ = E; L for good 
indices i, and \E\ a is a shorthand for 

\E\ a = I^HEaM^sp. 

If a = («i, a 2 , 0J3) is an admissible good tuple, we say that a 3-linear form A is of restricted 
type a if there exists j such that for every sequence Ei, E 2 , E 3 of subsets of R with finite 
measure, there exists a major subset Ej of Ej such that 

\Hfi,f2,fs)\<\E\ a 
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for all functions fi G X(E' i ), i = 1,2,3, where this time we adopt the convention E\ = Ei 
for the indices i ^ j . 

Let us consider now the 2-dimensional affine hyperspace 

S := {(ai, a 2 , a 3 ) E R 3 | «i + a 2 + a 3 = 1}. 
The points Ai, A 6 belong to S and have the following coordinates: 
A i: |) A 2 :(±,l,-§) A 3 :(l,|,-|) 

A 4 :(l,-|,|) A b :(\,-\,l) Aq : 

The points Mi 2 , M 34 , M 56 are midpoints of their corresponding segments and have the 
coordinates M 12 : (0,1,0), M 34 : (1,0,0), M 56 : (0,0,1). Also, the point A has the 
coordinates (1,1,-1). 

The following "restricted type" results will be proved directly. 

Theorem 3.5. For every vertex Ai, i = 1, . . . ,6 there exist admissible tuples a arbitrarily 
close to Ai such that the form A' is of restricted type a. 

Theorem 3.6. For the vertices M^q, Mi 2 , M34, A there exist admissible tuples a arbitrarily 
close to them, such that the form A" is of restricted type a. 

By interpolation of the restricted type estimates in the previous Theorem 3.5, we first 
obtain (cf. [8]) 

Corollary 3.7. Let a be an admissible tuple inside the hexagon \A\, ...,A^. Then A' is 
of restricted type a . 

Similarly, we also have 
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Corollary 3.8. Let a be an arbitrary tuple inside the polygon [M 5 qM u AM 12 ] . Then A" 
is of restricted type a . 

Intersecting these two corollaries we obtain the analogous result for A and the pentagon 
[M 56 M U A 3 A 2 M 12 \. 

Since one observes that Pi,p 2 ,p' 3 satisfy the hypothesis of Theorem 1.3 if and only if 
(l/pi,l/p 2 ,l/ps) G [M 5 qM 3 4A 3 A 2 Mi 2 }, it only remains to convert these restricted type 
estimates into strong type estimates. To do this, one just has to apply (exactly as in [11]) 
the multilinear Marcinkiewicz interpolation Theorem [4] in the case of good tuples and 
the interpolation Lemma 3.11 in [11] in the case of bad tuples. 

This ends the proof of Theorem 1.3. Hence, it remains to prove Theorem 3.5 and 
Theorem 3.6. 

4. Grids and tiles 

We now start to prove Theorem 3.5 and Theorem 3.6. We look at the symbol X£i<&<n 
as being the product between x$i<& an d X&<n- To construct the multipliers 
and m"^ we plan to carve these two symbols carefully into smaller pieces. To take care of 
the first symbol X£i<£ 2 > we ^ rs ^ nee d to recall some definitions from [10]. 

Definition 4.1. For n > 1 and a G {0, |, |} n . We define the shifted n-dyadic mesh 
D = D™ to be the collection of cubes of the form 

D n a := {<2?(k + (0, 1)" + (-l)V)|j G Z, k G Z"}. 

We define a shifted dyadic cube to be any member of a shifted n-dyadic mesh. 

Observe that for every cube Q, there exists a shifted dyadic cube Q' such that Q C ^Q' 
and \Q'\ ~ \Q\. 

Definition 4.2. A subset D' of a shifted 3-dyadic grid D is called sparse, if for any two 
cubes Q,Q' in D with Q ^ Q' we have \Q\ < \Q'\ implies |10 9 <5| < \Q'\ and \Q\ = \Q'\ 
implies ' 1 ()''(/ = 0. 

Observe that any subset of a shifted 3-dyadic grid, can be split into 0(1) sparse subsets. 

Definition 4.3. Let a G {0, |, |} 3 be a shift. A collection Q C of cubes is said to 
have rank 1 if one has the following properties for all Q,Q' G Q; 

• If Q 7^ Q' ) then Qi ^ Q\ for all i = 1, 2, 3 (in particular, the Q are disjoint). 

• If3Q f j C 3Qj for some j = 1, 2, 3, then 10 7 Q4 C 10 7 Qi for alll<i<3. 

• // we further assume that \Q'\ < |10 9 <5|, then we have 3Q[ fl 3Qi = for all i ^ j . 

Also, if Q is a cube in R 3 , denote by — Q the reflected cube about the origin; similarly 
for intervals in R. 

Definition 4.4. Let a = (<7i, a 2 , a 3 ) G {0, |, |} 3 7 and let 1 < % < 3. An i-tile with shift cr^ 
is a rectangle P = I P x oj p with area 1 and with Ip G oj p G D\.. A tri-tile with shift 

a is a 3-tuple P = (Pi, P 2 , P3) such that each Pi is an i-tile with shift o~i, and the I Pi = Ip 
are independent of i. The frequency cube Qp of a tri-tile is defined to be Qp = Yli=i ^Pi- 

We shall sometimes refer to i-tiles with shift a just as i-tiles, or even as tiles, if the 
parameters a, i are unimportant. 
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Definition 4.5. A set P of tri-tiles is called sparse, if all tri-tiles in P have the same 
shift and the set {Q p : P G P} is sparse. 

Again, any set of tri-tiles can be split into 0(1) sparse subsets. 

Definition 4.6. Let P and P' be tiles. We define a "relaxed ordering" and write P' < r P 
if Ip> £ Ip and 3u P C 3u P ,, and P' < r P if P' < r P or P' = P. We write P' < r P if 
1 P , C I P and Wup C Wu pl . We write P' <' r P if P' < r P and P' ^ r P. 

This "relaxed ordering" < r is in the spirit of that in Fefferman [2] or Lacey and Thiele 
[5], but slightly different as P' and P do not quite have to intersect. This is more 
convenient for technical purposes. 

Definition 4.7. A collection P of tri-tiles is said to have rank 1 if one has the following 
— * — * — * 

properties for all P, P GP: 

• IfP^ P', then Pj ^ Pj for all j = 1, 2, 3. 

• If Pj < r Pj for some j = 1, 2, 3, then P[ < r P, t for all I <i < 3. 

• If we further assume that \Ip,\ < 10 9 |/^| 7 then we have P( <' r Pi for all % ^ j. 

Definition 4.8. Let P be a tile. A wave packet on P is a function (pp which has Fourier 
support in j^oup and obeys the estimates 

\ct> P {x)\<\Ip\- l/2 Ux) M (7) 
for all M > 0, with the implicit constant depending on M . 

Heuristically, 0p is L 2 -normalized and is supported in P. 

To take care of the second symbol x&<n we need two more definitions. 

Definition 4.9. Let u e [—1,1] and v G R. We define the dyadic grid D = D u>v 
associated to the parameters u, v to be the collection of all intervals of the form 

TI + v, 

where I G Dq is an arbitrary dyadic interval on the real line. 

Definition 4.10. Let u G [—1, 1] and v G H, as before. A bi-tile associated to the 
parameters u,v is a rectangle P = Ip x ujp of area 2 such that ujp G D u v and Ip G D_ u . 
To any bi-tile P we define its (sub)-tiles P^Pi as being given by P\ = Ip 1 x uj Pl , P 2 = 
Ip 2 x ijjp 2 , where Ip 1 = Ip 2 := Ip, Up 1 := u l P , u)p 2 := oj p , while uj l p and uj p are the left and 
right halves of uj p . 

5. The symbol m' N 

Fix N. In this section we construct the symbol m' N . As in [10], by a standard partition 
of unity we can write 

x&<&(£l,6,6) = Yl 6, 6) 

ae{o,|,f} 3 QeQ CT 

whenever ^1 + ^2 + ^3 — 0, where Q CT C D\ is a collection of cubes which intersect the 
plane {£1 + £2 + £3 = 0} and which satisfy the Whitney property 

10 3 diam(Q) < dist(Q, {£1 = 6 + £ 2 + £3 = 0}) < 10 5 diam(Q) 
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for all Q G Qcr, and for each cube Q G Q CT , 0q )CT is a bump function adapted to j$Q- Note 
that by refining Q CT by a finite factor if necessary one can make Q CT have rank 1 (this 
refining of Q CT corresponds to decomposing X£i<6 i n ^° 0(1) pieces). 

Also as in [10], by splitting 0q i(7 as a Fourier series in the ^ we can then write 

(6,6,£ 3 ) = E C < E E 

ieZ3 CT6 { ,I,|}3Q6Q CT 

where q is a rapidly decreasing sequence and 7]Q jiCti ij is a bump function adapted to j^Qj 
uniformly in /. 

Let now be a Schwartz function so that <fi is supported on [—1/8,1/8] and which 
equals 1 on [—1/16, 1/16]. If uj is a dyadic interval, let W be the function defined via the 
Fourier transform by 

£ := T <u) D$fi (8) 

where in general 

T v f(x) :=f(x-y) 
D?f(x) := /(A" 1 *) 

are the translation and dilation operators respectively. As in [6] (see also [2]), one can 
write X(<n as 

Xt<N = c ■ lim / V xA^\N + V ))T^D^(0d V dk (9) 

■= / E x w -(2- fc (^v + »7))r_^£i(0^, 

where i? n := [0,1] x [—n,n 1 C R 2 and uj l ,uj r are the left and right halvess of uj. In 
particular, this implies that on the line £ + £' = in R 2 one has 



xtMZ,Z')= -f E x^(2- fc (7v + 7 7 ))r_ r?J D 2 ^0 (j( (O^^^-^(O^ 



where this time is a Schwartz function, such that is supported on [—1/4,1/4] and 
equals 1 on [—1/8, 1/8]. As a consequence, the expression 

£iv(£i,6,&):=5> E /E E 

CT e{0,i,§}3 J QGQct wGDfc i?) ;— QsCcj' (10) 

•X^(2iV)£i(fi + &)4>-A&)dr]dk 
on the hyperplane £1 + £2 + ^3 = is equal to XSi<6XIi±|2<jv when |£i — £ 2 | < l^y^ ~ ^1 
(under the latter constraint the condition — Q3 C a/ is automatic for nonzero summands). 
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Then, we simply define the symbol m' N by setting m' N := E^(C,i,C, 2 , — £1 — £2) (hi other 
words, as we said before, the intuitive description of m' N is a smooth restriction of x$i<&<n 
to the region where £1 and £ 2 are closer to each other than they are to N). 

6. Discretization of V 
Recall that the bilinear operator T' has been defined by the formula 

T'(h,f 2 )(x) := T m , mx) (f 1 ,f 2 ){x). 

The aim of the present section is to reduce Theorem 3.5 to the following discretized version 
of it. 

Theorem 6.1. Let a E {0, |, |} 3 be a shift and let u E [0, 1], v E R. Let P be a finite 
collection of bi-tiles associated to the parameters u,v and Q be a finite collection of tri- 
tiles of rank 1 with shift a. For each % — 1, 2 and P E P, let <pp i = (j)p u i and <pp i = 4>p u i 



be wave packets on Pj. Similarly for each i = 1, 2, 3 and Q E Q let §q % = 4>q. i be a wave 
packet on Qi. Define the form - by 

Ap q(/l,/2,/3) := ^2(B Pl §(fl, f2),<f>Pi)(<t>PiX{x:N(x)eup 2 }, f3) 



PeP 

where 

- - I (31 

Then for every vertex Aj j = 1,...,6 7 there exist admissible tuples a arbitrarily close to 
Aj such that the form A' p - is of restricted type a, uniformly in the parameters a , u, v 

P, Q 7 0p i; <f)p i 4>q.. Furthermore, in the case that a has a bad index j , the restricted 
type is uniform in the sense that the major subset E'j can be chosen independently of the 
parameters just mentioned. 

In the rest of this section we show how Theorem 3.5 can be deduced from Theorem 6.1. 
To see this, we just have to calculate the form A' carefully. We fix and first look at the 
trilinear form associated to the operator T m ^. It is given by the formula 



, T m , N (f 1 J 2 )( X )f 3 (x)dx= / 5(6+6+e3)^(ei,6,e3)/l(6)/2(6)/3fe)^l«3, 

1R J 

where S is the Dirac delta function. Clearly, E N is an average of some simpler expressions 
depending on the parameters /, a, rj, k. We fix all of them and look at the corresponding 
form given now by 

J Qm-QzQ^ (11) 

•/l(£l)/2(6)/3(6Rl^&- 
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Figure 2. The tiles of A' 
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To calculate further (11), let us first consider the simpler object 

UW := J 5(ei+6 + e3)% 1 (ei)^ 2 (6)%3(6)/l(ei)/2(6)/3(6)^1^2^3 

for some cube Q G Q. By Plancherel this is equal to 

J (/1 * VQi)(x)(f2 * fj Qa )(x)(f 3 * fjQ 3 )(x) dx 

= l{Qf 12 J (fi,(pQ 1 ,x)(f2,(pQ2,x)(f3,(pQ s ,x) dx 

where 

0Q„*(y) := i(Qr 1/2 v Qj (x-y) 

and /(Q) is the side-length of Q. We can rewrite this as 

f E i / pr 1/2 (/i ? ^ 1 ,u)(/ 2 ,0p 2 ,t, 2 )(/3,0P3, t ,3)^ 

JO - 

where P ranges over all tri-tiles with frequency cube Q and spatial interval Ip in 
4>p h t,j is the function 

4>Pj,t,j '■= <t>Qj,xp+\I p \t 

and xp is the center of I p. Note that 0p-,tj is a wave packet on Pj uniformly in t. 
Similarly, we consider 



(f,b u ) := J 5(e + Ox^(2iV)0^(O0^(O«' 
for some fixed uj as above. Again, by Plancherel, this is equal to 
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'0 



E (/,0P 1 ,t,i)(/3X.p 2 (2iV),0p iiM )rft 

P;lop=ui 



where P ranges over all bi-tiles with frequency interval u, Pi := Ip 1 x (— uj Pi ) and 

(f>p u t,i(y) '■= Itp^^^ixp + \Ip\t - y), 



<h u t,i(y) '■= \ip\ 1/2 $-tj( x p + \ip\t - y) 



where xp is the center of the spatial interval Ip. As a consequence, our expression in (11) 
can be written as 



o Jo 



E a Q (r)b w (-T) dr 
■,-QsQoj' J 

J2( B PuAfij2)Ap 1 ^)(f3X^m^Pa,i) dtdtf 



Q,u,;-Q 3 Cu> 1 

1 r l 



PeP 



where 



Bp 1 ,t'{f2,h) '■— E TTjT/2^ 1 ' ^Qi,*',l)(/2) </>Q2,t',2)</>Q 3 ,t',3- 

Note that the collection Q of tri-tiles Q has rank 1. Observe that we can get rid of 
the complex conjugation sign in the definition of B Pltt / by redefining Q 3 to be —Q3 and 
redefining 4>Q 3 ,t',z accordingly; this also replaces the condition —ujq.^ C uj Pi by the condition 
mq 3 Q but it does not change the rank one property of the collection Q. 
This means that the operator corresponding to the form (11) becomes now 



1 /•! 



(B Put/ (f 1 J 2 ),<l )Pltt , 1 )^ utjlXuP2 (2N)dtd^ 

jo 

/ ( J Bp lI^; (/ 1 ,/ 2 ),0p lItIl )0p lItIl x. P2 (2iV)rf^dt , . 
'0 Jo 

In particular, our operator T' is in fact an average of simpler operators U of the form 

U{hJ 2 ){x) = f /" 1 (S Plit ,(/ 1 ,/ 2 ),0p 1)til )0p 1)til (x)xa, P2 (2iV(x))^ / . 
Jo Jo 

The claim then follows by integrating the conclusion of Theorem 6.1 over t, t', rj, k, 
summing over a, I and using the uniformity assumptions of that Theorem. (The finiteness 
condition on P and Q can be removed by the usual limiting arguments.) 
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7. The symbol m" N 

Fix N. In this section we construct the symbol m!' N . First, by using (9) we know that 

Xti<N ■ X&<n = 



ff 



E xA^\N+r 1 ))T_ v D^ l {i l ) Xjr {2- k \N+i^ 

w,u'eDo,o 



E E xAN)M^)xMN)^2)dvd V 'dkdk'. 
Then, we define m" N to be given by the following formula 

m N ■= ff E xAN)^i)xM^)M^)dvdv'dkdk'. 

bj' r r\bJ r ^;\J r \<\u} r \ 

We observe that the above expression is equal to XZi<&<n on the cone |£ 2 — N\ <C |£i — N\ 
(under the latter constraint, the condition J r fl uo r ^ 0; \oo' r \ < \uj r \ is automatic for 
nonzero summands). In the end we set m% := Xti<&<N — m N ~~ m N an d remark that all 
the requirements described in Section 1 are satisfied. 

8. Discretization of T" 
Recall that the bilinear operator T" has been defined by the formula 

T"(h,f 2 )(x) :=T ml ,JhJ 2 )(x). 

The aim of the present section is to reduce Theorem 3.6 to the following discretized version 
of it. 

Theorem 8.1. Let u,u' G [0,1], v,v' G R. Let P, Q be finite collections of bi-tiles 
associated to the parameters u,v and u',v' respectively. For each i — 1,2 and P G P, let 
(ft Pi — (pPi,i an d (ft Pi = <f>p it i be wave packets on Pi. Similarly for each i — 1, 2 and Q G Q 

let 4>q. = 4>Q u i and (fig. = 4>Q ui be wave packets on Qi. Define the form A p q by 

^P,q(/i)/2)/3) := ^ (fl , </>Pi ) (0Pi X{x:N(x)£u> Po } C^,Q (/ 2 ) , j-j) 

PeP 

where 

Cp 2 ,Q(f 2 ) := E (h><l > Qi)$QiX{x:N(x)eu Q2 }- 

u)Q 2 C\u)p 2 jt%\\wQ 2 \<\wp 2 \ 

Then for the vertices M 56 , M 12; A 2 , there exist admissible tuples a arbitrarily close to 
them such that the form Ap Q is of restricted type a, uniformly in the parameters a, u,v 

P, Q ; (pp^ <f)p i , 4>q., <pQ. Furthermore, in the case that a has a bad index j , the restricted 
type is uniform in the sense that the major subset Ej can be chosen independently of the 
parameters just mentioned. 
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In the rest of this section we show how Theorem 3.6 can be deduced from Theorem 
8.1. To see this, as before, we just have to calculate the form A" carefully. We fix N and 
first look at the trilinear form associated to the operator T m » . Clearly, wl' N is an average 
of some simpler symbols depending on the parameters 77, 77', k, k'. We fix all of these 
parameters and look at the corresponding operator given by 

/ E x^(^)£Kei)x.v(A^)^(6)/i(6)/ 2 (6)e 2 ™« 1+6) ^i^ 2 

w'TiwV0;l^' r l<l^ r l ( 12 ) 

J r f\ui r ^%;\J r \<\w r \ 

which can be discretized as before into 
1 

E (/l,0Pi,t,l>0Pi,t,lX^P 2 (^)(/2, (f)Q ut >,i)4>Q 1 ,t>,iXu, Q2 (N)dtdt l . 

u)q 2 C\u)p 2 ^%\\wq 2 \<\u)p 2 \ 

The claim then follows using a similar argument. 

9. TREES 

The standard approach to prove the desired estimates for the forms A' and A" is to 
organize our fixed collections of tri-tiles and bi-tiles into trees as in [2]. We may assume 
and shall do so for the rest of this article that our collections of tri-tiles are sparse. 

Firstly, we define trees in the context of of tri-tiles, by using the "relaxed ordering" < r 
considered before. 
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Definition 9.1. Let P be a collection of tri-tiles. For any 1 < j < 3 and a tri-tile Pt G P, 
define a j-tree with top Pt to be a collection of tri-tiles TCP such that 

Pj < r Pt,j for all P G T, 

— * 

where Ptj is the j component of P T . We write It and u>tj for Ip T and uj Pt . respectively. 
We say that T is a tree if it is a j-tree for some 1 < j < 3. 

— * 

Note that T does not necessarily have to contain its top P T . 

Definition 9.2. Let 1 < i < 3. Two trees T , T' are said to be strongly i-disjoint if 

• Pi^p; for all P eT, P' e T. 

• Whenever P G T , P' G T' are such that 2u>p i n2up> ^ 0, then one has Ip,nlT = 0, 
and similarly with T and T' reversed. 

Note that if T and T' are strongly i-disjoint, then I P x 2up i fl Ip> x 2ujp> i = for all 

P eT, P' e T. 

— * — * — * 

Given that P is sparse, it is easy to see that if T is an i- tree, then for all P,P' G T 
and j 7^ i we have 

LO Pi = Up^ 

or 

2u Pj n 2cjpj = 0. 

Secondly, we define trees in the context of bi-tiles, this time by using the "classical 
ordering" < c as in [2]. 

Definition 9.3. We define a partial ordering on the set of bi-tiles P by P < c P' if 
Ip Q Ip> and ujpi C ojp. A set T of bi-tiles is called a tree, if there is a tile Pt, the top of 
the tree, such that P < c Pt for all P G T. If j = 1,2, a tree is called j-tree if u)p T . C ijj P] 
for all P G T '. Notice that as before, we do not require the top to be an element of the 
tree. 

10. Tile norms 

We start the study of the form In this section we mostly recall some definitions 

and results from [10] and [11]. In the paper we shall be frequently estimating expressions 
of the form 



E wM«gi d3) 



PeP 



where P is a collection of tri-tiles and a^p. are complex numbers for P G P and j = 1, 2, 3. 
In some cases (e.g. if one only wished to treat the Bilinear Hilbert transform) we just 
have 



a 



U) 



(fi^Pi) (14) 



(j) 

In [9] the following (standard) norms on sequences of tiles were introduced: 



but we will have more sophisticated sequences ap 1 . when dealing with Ap ^ 
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Definition 10.1. Let P be a finite collection of tri-tiles, j = 1,2,3, and let (dPj) Pe p be 
a sequence of complex numbers. We define the size of this sequence by 




1/2 



where T ranges over all trees in P which are i-trees for some % ^ j. We also define the 
energy of the sequence by 

energy j((a Pj )p e p) := sup(J^ \a Pj \ 2 ) 1/2 

Dcli Pen 

where D ranges over all subsets ofP such that the tiles {Pj : P G D} are pairwise disjoint. 

The size measures the extent to which the sequence a Pj can concentrate on a single 
tree and should be thought of as a phase-space variant of the BMO norm. The energy is 
a phase-space variant of the L 2 norm. As the notation suggests, the number a Pj should 

be thought of as being associated with the tile Pj rather than the full tri-tile P. 

In the Walsh model the energy is a tractable quantity; for instance, if a Pj is given by 
(14) then one can control the energy by \\fj\\2 thanks to the perfect orthogonality of the 
Walsh wave packets. However, in the Fourier case the orthogonality is too poor to give a 
usable bound on the energy, and so we must instead use a more technical substitute. 

Definition 10.2. Let the notation be as in Definition 10.1. We define the modified energy 
of the sequence (a Pj )p € p by 

e^y 3 ((a P] ) PeP ) : = supsu P 2"(]T \I T \) 1 / 2 (15) 

nSZ T TeT 

— * 

where T ranges over all collections of strongly j- disjoint trees in P such that 

(^|ap 3 | 2 ) 1 /2> 2 ™|/ T | 1 /2 
Per 

for all TeT, and 

(J2 KI 2 ) 1/2 <2 n+1 |/T'| 1/2 

PGT' 

for all sub-trees T' C T e T. 

The reader may easily verify that the modified energy is always dominated by the 
energy, and that we have the monotonicity property 

energy^ {a Pj ) Pe p,) < energy^ (a Pj )p e $) 

whenever P' C P. 

The usual BMO norm can be written using an L 2 oscillation or an L l oscillation, and 
the two notions are equivalent thanks to the John-Nirenberg inequality. The analogous 
statement for size is 
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Lemma 10.3. Let P be a finite collection of tri-tiles, j = 1,2,3, and let (a Pj )p e p be a 
sequence of complex numbers. Then 

wzej{{a Pj )p e p) ~ sup r^lK^ I^/tft) 172 ^ 1 ' 00 ^) ( 16 ) 

TCP I Tl 1 31 



Per 



where T ranges over all trees in P which are i-trees for some % ^ j. Similarly, for 
1 < p < oo, one has 



wzej((a Pj )p eP ) ~ sup -— ^ || lap/ry 



2 -^ 7 p \ 1/2 I 



TcP r T 

PeT 

and t/ie implicit constants are allowed to depend on p. 



lp (i T ), 



Proof: The same as in [9], Lemma 4.2 . ■ 

The following estimate is standard, see [10], Proposition 6.5. This is the main combi- 
natorial tool needed to obtain estimates on (13). 

— * — * — * 

Proposition 10.4. Let P be a finite collection of tri-tiles, and for each P G P and 
j = 1, 2, 3 let afp* be a complex number. Then 

I E |7qi^ a S«S«Sl S n^iC^i)^)^^,^^)^) 1 ^ ( 17 ) 

Pep p i =1 
/or any < 1 ,9 2 ,9 3 < 1 win Oi + 9 2 + 63 = 1, ™^ implicit constant depending on 
the 9j. 

Note that this Proposition is stronger than that of the corresponding statement ([9], 
Proposition 4.3) for the unmodified energy. Of course, in order to use Proposition 10.4 
we will need some estimates on size and energy. 

The following Lemmas have been proven in [10]. 

Lemma 10.5. Let j = 1,2,3, fj be a function in L 2 (R), and let P be a finite collection 
of tri-tiles. Then we have 

^^.■(((/i.WW) ^ ll/ilh- (18) 
Lemma 10.6. Let j = 1,2,3, Ej be a set of finite measure, fj be a function in X{Ef), 

— * 

and let P be a finite collection of tri-tiles. Then we have 

size,(((/,,^.)) P ^)<sup^-^ (19) 

Per 3 I j pI 

for all M, with the implicit constant depending on M. 
We shall also frequently estimate expressions of the form 

|^a Pl 6 P2 | (20) 

PGP 

where P is a collection of bi-tiles, a Pl are complex numbers as before, while b P2 are complex 
numbers of the form 
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bp 2 = (Gx{x/N{x)eu>p 2 },<f>Pi)- (21) 

We need to define now "sizes" and "energies" for our bp 2 sequences. This time, they 
will no longer depend on the index "j" as before. 

Definition 10.7. Let P be a finite collection of bi-tiles, let P be the collection of all bi- 
tiles corresponding to our fixed dyadic grid (so I* is a finite subset ofP) and let (&p 2 )pep 
be a sequence of complex numbers of the form considered above. We define the size of this 
sequence by 

size((6p 2 ) PeP ) := sup sup rj^f \G\x{x/N(x)eu P ,}Xj p/ dx (22) 

P&P P><=P:P<cp> \ 1 P'\ JH 

where C is a fixed big constant. We will also need the "easy variant" of the size, defined 
by 

size e ((6p 2 )p eP ) := sup — / \G\x{ x /N{x)eu; P }X < i P dx. (23) 
pgp |Jp| JR 

Also, since we are in the Fourier setting, we define again a modified energy of this 
sequence by 

energy((6p 2 )p 6P ) := supsup2 n ( \ J P'\) ( 24 ) 



nGZ D p/gD 



where D ranges over all collections of disjoint bi-tiles P' G P with the property that there 
exists P G P with P < c P' and such that 



\ \G\x{x/N(x)eu P ,}Xi p , dx > T\I P i 
J 1R 



As before, energy((6p 2 )p eP ) should be understood as a technical substitute for the more 
natural (see [11]) "energy ((6p 2 )p eP )" defined by 

energy ( (6p 2 ) PeP ) := sup / \G\x{xiN{x)^ P ,}X C i pl dx, 

DCP p/ eD -/H 

where D ranges over all collections of disjoint bi-tiles P' for which there exists P G P 
with P < c P'. 

The main combinatorial tool needed to estimate (20) is the analogue of the above 
Proposition (10.4) for bp 2 sequences: 

Proposition 10.8. Let P be a finite collection of bi-tiles, and for each P G P let ap 1 and 

bp 2 be complex numbers as before. Then, 

I Yl ap ^\ ~ size 1 ((apJp) 01 size((6p 2 ) P ) e2 el^ 1 ((apJp) 1 - 01 el^y((6p 2 )p) 1 - e2 
pgp (25) 
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for any < Q\ < 1, < Qi < 1 with Q\ + 29 2 = 1. Moreover, if the bi-tiles of P 
are disjoint, then the inequality holds even if one replaces size((6p 2 )p) with the smaller 
quantity size e ((&p 2 )p). 

The proof of this Proposition will be presented later on. In the meantime, we will take 
it for granted. In order to use Proposition 10.8, we need again estimates on sizes and 
energies. 

Lemma 10.9. Let f e L 1 (R) and P be a finite collection of bi-tiles. Then, one has 

energy(((fX{x/N{x)eoj P2 },<f>Pi))pep) ^ ll/lli- (26) 

Proof: Fix n and D so that the suppremum is attained in the definition of energy. 
Then, we can write 

e^gy(((fx{x/N(x)eujp 2 }APi))pep) ~ 2 n ( ^ \I P ,\) < ^ / \f\X{x/N(x)eu, P ,}X? pl dx 

P'eD p'eu^^ 

and this is smaller than \\f\\i by using Proposition 3.1 in [6]. ■ 
Lemma 10.10. Let E be a set of finite measure and f G X{E). Then, 

size(((/x W jv(x)e^ 2 },0Pi»P6p) ^ sup sup E p ' (27) 

PeP p'eP:P< c P' \ 1 P'\ 

and similarly, 

size e (({fx{x/N(x)eu, P2 }, 0Pi))p e p) ^ sup f p . (28) 

PeP \-lp\ 

for every M < C . 

The proof follows directly from definitions. In the next section we shall show how 
the above size and energy estimates can be combined with Proposition 10.8 and the 
interpolation theory of the previous section to obtain Theorem 1.1. To prove the estimates 
for the form A' - we need some more sophisticated size and energy estimates, which we 
will pursue after the proof of Theorem 1.1. 



11. Proof of Theorem 1.1 

We now give a proof of Theorem 1.1. We present its proof here for expository purposes, 
and also because we shall need Theorem 1.1 to prove the size and energy estimates needed 
for Theorem 3.5. 

First, we linearize the Carleson operator as 

Cf(x) = [ f(0e 2m M 

J£<N(x) 

and then we dualize it into the bilinear form defined by 
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Ac(/i,/ 2 ) := [ C(f 1 )(x)f 2 (x)dx. 

By standard discretization arguments as in the previous sections, we may reduce the study 
of Ac to the study of discretized operators of the form 

Ac,p(/i,/ 2 ) := (C P (/i),/ 2 > 

= ^ (/l , <f>Pi ) {f2X{x/N(x)€u> Po },<l>Pi), 

PeP 

where P is some finite collection of bi-tiles. 

We shall use the notation of Section 3, with the obvious modification for bilinear forms 
as opposed to trilinear forms. 

Let us also consider Ei, E 2 sets of finite measure and 1 < q < 2. We are going to prove 
directly that there exists a major subset E' 2 of E 2 so that 

|A^(A J 2 )| < |£i| 1/9 |£2| W , (29) 

for every f\ G X(Ei), f 2 G X{E' 2 ) and also that there exists a major subset E[ of E 1 so 
that 

|Ac,p(/i,/ 2 )|<|^2|, (30) 

for every fi G X(E[), f 2 G X(E 2 ). (The reader will notice that a similar argument can 
be used to prove that there exists a major set E[ of E 1 so that \A C: r>(fi : f 2 )\ < II/2II1, 
for every fi G X(E[) and f 2 G L X (R). This is actually equivalent to the fact that the 
adjoint C* of the linearized Carleson operator is of weak type (1, 1); thus, it differs at this 
endpoint from the Carleson operator, which is not of weak type (1,1), see [2]). 

Then, by using the interpolation arguments in [8], it follows that the form Ac,p is of 
restricted type a, for every a in the interior of the segment defined by the endpoints (0, 1) 
and (1,0). Finally, Theorem 1.1 is implied by the classical Marcinkiewicz interpolation 
theorem. 

It thus remains to prove (29) and (30). 

To prove (29), we may assume by scaling invariance, that \E 2 \ = 1. Define the excep- 
tional set 

2 

n := \J{x/M X e 3 > C|^|}, 
i=i 

where M is the Hardy-Littlewood maximal function. By the classical Hardy-Littlewood 
inequality, we have \Q\ < 1/2 if C is big enough. Thus, if we set E' 2 := E 2 \ fi, then E' 2 is 
a major subset of E 2 . Let now fi G X(E 1 ) and f 2 G X(E 2 ). We need to show that 

|5> Pi 6p 2 | <|£i] 1/9 , (31) 
PeP 

where we denoted by 
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a Pl := (/i,0 Pl ) 

&p 2 := (f2X{x/N(x)e<^P 2 }, ( t ) Pi)- 



We shall make the assumption that 

x + dist(/ P ,n c ) ^ 2fc 
\ip\ 

for all PeP, for some k > independent of P and prove (29) with an additional factor 
of 2~ tk on the right hand side (for some e > 0). If we can prove (29) in this special case 
with the indicated gain, then the general case in (29) follows by summing in k. 
By the definition of Q we thus have 



f El xi P <2k , E , 



\Ip\ 

for all remaining bi-tiles P G P. From Lemma 10.6 we thus have 

sizei((a Pl ) PeP ) < 2 fe |£ 1 |. 
Also, from Lemma 10.5 and the fact that f\ G X(Ei) we have 

enCTgy 1 ((a Pl )p eP ) < |Pi| 1/2 . 

Similarly, for < k < 5 by usind the definition of Q and by applying Lemma 10.10 we 
have 

size((6p 2 ) PeP ) < 1. 

For k > 5, we observe that the corresponding bi-tiles are essentially disjoint and the same 
Lemma 10.10 gives the estimate 

sfze e ((&p 2 )p eP ) < 2~ Mk , 
for any M > 0. On the other hand, we have from Lemma 10.9 

energy ((6p 2 ) PeP ) < 1. 
From Proposition 10.8 we thus have 

\J2 a pM\ £ (2 fc |P 1 |) f?1 (2" Mfc ) f?2 |Pi| (1 ^ l)/2 = 2- k ^ Me ^\E 1 \ (1+e ^/ 2 , 
PeP 

for every 9i G (0, 1). If we chose now 6\ so that (1 + #i)/2 = 1/q and M big enough, this 
proves (29). 

To prove (30), we assume again without loss of generality that \Ei\ = 1, and define E[ 
similarly. Then, as before, we restrict the summation over those P having the property 
that 

1 + di St (/ P ,^ 2fc 

\Ip\ 

This time, we get the bounds 
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size 1 ((a Pl )p e p)<2- Mfe 
energy! ((a Pl ) Pe p) < 1 
size((6p 2 )p e p) 
energy((6p 2 )p gP ) < \E 2 \ 
and finally, by applying Proposition 10.8, we obtain 

I Yl ap ^\ ~ (2- Mfe ) 9l (2 fc |^2|) 92 |^ 2 r^ 2 = 2- k ^ M9 ^\E 2 \ 
PeP 

which again completes the proof, if M is a big constant. 

12. Estimates for A' =. 

— » — > 

We now continue the study of the form A -.. Fix P, Q and drop any indices P and Q 

for notational convenience. 

We also drop the ~ ' s in the definition of A' p( -, for simplicity. 

In the expression A' the Q tile in the inner summation has a narrower frequency interval, 
and hence a wider spatial interval, than the P tile in the outer summation. Thus the inner 
summation has a poorer spatial localization than the outer sum. It shall be convenient 
to reverse the order of summation so that the inner summation is instead more strongly 
localized spatially than the outer summation. Specifically, we rewrite A' as 



K,aish(fuh> fa) - |/.|i/2 a S a S a S 



QeQ 



where 



a S : = (/i,0Qi> 

a Q2 ■= (/2,0Q 2 ) (34) 

a S := (/3XWAT(aOeu,pJ,0Pi)(0Pi,0Q 3 >- 
PeP ; u>q 3 Cwpj 

If P' is an arbitrary subset of bi-tiles, we also define Cp, to be the operator given by 



C P'U) ■= ^(fX{x/N{x)eu P2 }Api)<t>Pi- (35) 
PeP' 

This operator is the adjoint of the Carleson operator and is therefore bounded on every 
W space for 1 < p < oo (see Theorem 1.1). 

To estimate our form A', we need analogues of Lemma 10.5 and Lemma 10.6 for a^. 
One crucial new ingredient in doing so shall be the following simple geometric lemma 
which allows us to decouple the P and Q variables. 

Lemma 12.1. Let i ^ 3 and let T C Q be an i-tree of tri-tiles. For any Q G T we denote 
by P^ the set 
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Similarly, we define the larger set P T by 

P T := {P G P : u) Q . A C u Pl ,for some Q G T}. 

— * 

Then, for any Q G T and any function f , we have the equality: 

(C* P ^f),M = (C* PT (f),<P Q3 )- (36) 
Proof: Fix Q G Q and note that the left hand side of (36) equals 

^ (fX{x/N{x)eu>p 2 },^Pi)((/>P 1 ,^Q 3 ), 

while the right hand side of (36) equals 



^ (fX{x/N{x)eu>p 2 },(f>Pi)( ( f ) PiAQ3)- 
PeP T 

Clearly, the sum on the right hand side contains more terms than the sum on the left 
hand side. Let us now take a look at one potential non-zero term on the right hand side 
which does not appear on the left hand side (our claim is that such terms do not exist !). 

It corresponds to a bi-tile P G Pt so that {4> Pl , 4>q s ) ^ 0. By Plancherel, it follows 
that ^ > ujp 1 fl j^q 3 7^ and in particular this implies that \ui Pl \ < 10|u;q 3 | (if not, than we 
would have 10|c<Jq 3 | < \oJp^ and so uq 3 C u>p 1 , which contradicts that the corresponding 
term does not appear on the left hand side). 

Since this bi-tile P belongs to Pt, it follows that there exists Q' G T such that ujq^ C lo Pi 
and this means that up 1 intersects both ojq^ and ojq 3 and also that 

\uq> 3 \ < \u Pl \ < 10\wq 3 \. 
But these two facts contradict that T is an i-tree (i ^ 3) and the sparseness of our 

— * 

collection of tri-tiles Q. The claim follows. 

■ 

The energy estimate is given by the following lemma: 

Lemma 12.2. Let E 3 be a set of finite measure and f 3 be a function in X(E 3 ). Then we 
have 

e-^rgy 3 ((«g)Q 6 Q) S l^| 1/2 - (37) 

Proof: Let n, T be an extremizer in the Definition 10.2 of energy^aQ^)^^). By 
duality, there exists a sequence (cq 3 )^ of complex numbers so that 

CTCTgy 3 ((ag]) <56 Q) ~ J2 a S c Q 3 - 

Moreover (see for instance Lemma 6.3 in [10]), the sequence (cq 3 )^ has the property that 
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Qef LreT |Jt| 

for all T C T G T. Then, the new expression of the energy becomes 

Yl Yl (/3XW7V(x)e^p 2 },0P 1 )(0P 1 ,0Q 3 ) C Q3 

QeT PeP;wQ 3 Cw Pl 

:= ^ bp 2 a Pl . 

PGP 

By using Lemma 10.8 in the particular case 9\ — 0, 62 — 1/2 the above expression can be 
majorized by 

size((6pJp) 1/2 energy 1 ((a Pl )p)energy((6p 2 )p) 1/2 . (39) 
Using the previous Lemmas and also Corollary 8.4 in [10], we obtain the estimates 

energy! ((apjp 6 p) < 1 

size((6p 2 )p eP ) < 1 (40) 
energy ((6p 2 )p eP ) < \E 3 \. 

Using them in (39) we end up with I-E3I 1 / 2 as desired. 

■ 

We shall also need the following estimate. It is a local version of the Carleson theorem, 
Theorem 1.1. 

— * 

Lemma 12.3. Let i 7^ 3 and let T C Q be an i-tree of tri-tiles. Let also e > 0. Using the 
same notations as in Lemma 12.1, the following inequality holds. 

' r \ V(i+0 / r \ V(i+0 

J u \Cp T (f)\ 1+£ x¥ T dx) <{j E ^ dx ) 

for any f G X(E) and any big M . 

Proof: To prove our inequality, it is easy to see that it is enough to show that for every 
interval ICR with \I\ = \It\ one has 

/ r \ i/(i+0 

\\C* P M)\\ L ^ {I) Z{J E Xi T dx) • (41) 

Then, we estimate the left hand side of (41) by 
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^ (fX{x/N(x)€u 1 p 2 },^P 1 )<Pp 1 

/ P n(3/) c ^0 



Li+«(J) 



+ 



^ (fX{x/N(x)e^p 2 }APi) ( l ) Pi 
/ P n(3/) c =0 



:= A + B. 



Li+«(J) 



We estimate term A by 



E 

fc=0 



^ (fX{x/N(x)e^P 2 },<l>Pi)<l>Pi 

/ P n(37)^0;|7p|~2-*|/ T | 



Li+«(J) 



< 



E 

fc=0 



E 



(l/l,X/ P ) 



/ P n(3/) c ^0 ; |/ P |~2- fe |/ T | 



Li+«(J) 



oo 

5E 

fc=0 \/ P n(3/) c ^0;|/ P |~2-'=|7 T | 



E w — m 



V(i+6) 



oo ✓ „ v l/(l+e) / /• \ 



< 



1/(1+^) 



To estimate term B we have two cases. First, assume that supp(/) C (5/) c . Then we can 
estimate it by 



< 



E 

k=0 



E 



(l/l,X/ P ) 



/ P n(3/) c =0;|/ P |~2- fe |/ T | 



Li+«(J) 



oo 

<E 



E 



fc=0 \/ P n(3i") c =0;|/ P |~2- fe |/ T | 



„ W dist(/ P ,(5/) C ) V 

X/ P J ( j-p-j J 

Up 



1/(1+6) / /• \ V(i + 6) 

< I / xidx 



1/(1 + 6) 



If, on the other hand, supp(/) C 5/ then the corresponding term B is smaller than 

1/(1+6) 



1+e ~ 



Xidx 



Je 



just by using the fact that the operator Cp maps L 1+t into itself. 



The analogue of Lemma 10.6 is 
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Lemma 12.4. Let e > 0, E 3 be a set of finite measure and f 3 be a function in X{E 3 ) 
Then we have 



size 3 ((ag) Q - e(3 ) < sup(-^ / Xi Q ) m+e) (42) 
Of=n \ 1 Q\ Je* 



) G Q l J Ql J E 3 

Proof: By Lemma 10.3 it suffices to show that 

1 „ NT^ , (3) |2 XJga/2 r - ( 1 



V(l+6) 



for some i ^ 3 and some i-tree T. Using Lemma 12.1, the left hand side of (43) equals 

4tII(E l^p r (/3),0Q3)l 2 ^) 1/2 llL^a T) . 

By using Lemma 10.6 and Lemma 12.3 this expression can be majorized by 



y R ic; T (/ 3 )ix^< ^— y R i^ T (/ 3 )r +e x^j s 



and this ends the proof. 

■ 

13. PROOF OF THEOREM 3.5 

— * 

We can now present the proof of Theorem 3.5. Fix the collections P and Q. We 
first show that A' - is of restricted weak type o for all admissible 3-tuples (0:1,0:2, ^3) 
arbitrarily close to A 2 , A 3 , so that the bad index is 3. 

Fix o as above and let Ei, E 2 , E 3 be sets of finite measure. 

By scaling invariance we may assume that = 1. We need to find a major subset 
E 3 of E 3 such that 

|Ap,Q(/l,/2,/ 3 )| <\E\° 

for all functions G X(Efi, i = 1,2,3. 
Define the exceptional set Q by 

3 

n-.= \J{M XEj > C\Ej\} 

3=1 

where M is the dyadic Hardy-Littlewood maximal function. By the classical Hardy- 
Littlewood inequality, we have |Q|<l/2ifCisa sufficiently large constant. Thus if we 
set E' 3 := E 3 \ Q, then E 3 is a major subset of E 3 . 

Let than /j e X(E' j ) for i — 1,2, 3. We need to show that 

I V 1 r7 (1 W 3) l < \ F\ a (AA\ 



QeQ 

,0") 



where is defined by (34). 

We shall make the assumption that 
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1 , dist(I^) 

m — ~ 

for all Q G Q, for some k > independent of Q and prove (44) with an additional factor 
of 2~ ek on the right hand side (for some e > 0). If we can prove (44) in this special case 
with the indicated gain, then the general case in (44) follows by summing in k. 
By the definition of Q we thus have 

4jf xi<,<2 k m 

\ 1 Q\ JEj 

for j = 1, 2. From Lemma 10.6 we thus have 

size j ((ag) Q ^)<2 fe |^.| 
for j = 1,2. On the other hand, by Lemma 12.4 one has 

**s((«g)*a) £ 2 ~ Mk 

since I-E3I = 1, for any big constant M > 0. Also, from Lemma 10.5, Lemma 12.2 and 
the fact that fj G X{E'-) we have 

energy^ag)^) < |£// 2 , 

for j = 1, 2, 3. From Proposition 10.4 we thus have 

2 



1 



11/2 " , Qi" , Q 2 



QeQ Q V7=i 

for any < 61, O2 < 1 such that there exists < 6% < 1 with 0\ + #2 + $3 — 1- The claim 
then follows by choosing #1 := 2«i — 1, #2 : = 2«2 — 1 and M big enough; note that there 
exist choices of a arbitrarily close to A 2 or A 3 , for which the constraints on 9i,9 2 ,0 3 are 
satisfied. 

To prove the restricted type estimates for a arbitrarily close to A 4 , A 5 , A 6 , A 1: one 
argues in the same way, by taking advantage of the fact that e in Lemma 12.4 can be 
arbitrarily small. 

This concludes the proof of Theorem 3.5. 

14. Estimates for A p q 

In this section we begin the study of A p q. As before, fix P, Q and drop any indices P 
and Q for notational convenience. We also drop the ' ' s in the definition of Apq, for 
simplicity. 

Also, as in the previous sections, it is more convenient to rewrite A" as 

A ,, (/ 1 ,/2,/ 3 ) = 

^ (f2, 0Qi) ((pQiX{x/N(x)e^ Qr ,} ^ (fl,<f>Pi)<f>PiX{x/N(x)eu>p 2 }, f3)- 

QeQ Lu Q2 nujp 2 ^®;\ujQ 2 \<\ujp 2 \ (45) 

Expressions of this type have been considered before (see Section 10), but this time, the 
presence of the inner sum makes their study much more delicate. 
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Let us now fix E X ,E 2 , E 3 arbitrary sets of finite measure so that |-E 3 | = 1. Define 

2 

n:={J{M XEj >C\E j \} 
j=i 

for a large constant C, and set E' 3 := E 3 \ fl Clearly, \E' 3 \ > 1/2 if C is a big enough 
constant. Pick now fi G X(E 1 ), f 2 G X(E 2 ) and f 3 G X(E' 3 ). 

The main combinatorial tool needed to estimate our form A" is the analogue of the 
above Proposition 10.8. 

Proposition 14.1. Let e > be a small number, and fi, f 2 , f 3 as above. Let also Q be a 
finite collection of bi-tiles. Then, 

|A"(/i,/2,/ 3 )| < size 1 (((/ 2 ,0Q 1 )) Q ) 9l size(((/3X{^) e . Q },0Q 1 ))Q) f?2 • (46) 
•enCTgy 1 (((/ 2 , (j>Q l ))Q) 1 ~ ei ^^fiS.{hX{x:N{x)e UQ h 0q 1 ))q) 1 ^ 2 - 

• sup( ) +\E 1 \ a 

Q \1q\ 

for any O<0i<l, O<0 2 <1 — e with 9i + 29 2 = 1 - 2e and < a < 1 - e. Moreover, 
if the bi-tiles in Q are disjoint, then the above inequality holds even when one replaces 
size(((/ 3 X{z : Ar(z)e^ Q },0Qi))Q) by the smaller quantity size e (((f 3 X{ x: N(x)euj Q }, 4>Qi))q)- 

The proof of this Proposition will be presented later on. In the meantime, we will take 
it for granted. 

In the next section we shall show how the above size and energy estimates can be 
combined with Proposition 14.1 and the interpolation theory, to obtain Theorem 3.6. 

15. Proof of Theorem 3.6 

Let P = (Pi, (3 2) f3 3 ) be an admissible tuple, very close to either of the points M 12 or 
M 56 or M 34 or A. 

Let us also fix Ei,E 2 ,E 3 arbitrary sets of finite measure and assume without loss of 
generality that \E 3 \ = 1. 
As usual, we define 

2 

n := \J{M XEj > C\E 3 \} 

j=l 

for a large constant C, and set E' 3 := E 3 \ f2. We now fix fi G X(Ej) for % — 1,2 and 
f 3 G X(E' 3 ). Our task is then to show that the following inequality 

|A w (/i,/ 2 ,/3)|<|^|^ (47) 

holds, for any Pi and P 2 arbitrarily inside the interval (0, 1). Clearly, this would complete 
the proof. 

As before, we may restrict the collection Q to those bi-tiles Q for which 
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1 | dist(/Q, £2 c ) _ nk 



f-NJ 



for some k > independent of Q and prove (47) with an additional factor of 2~ Xk on 
the right hand side (for some A > 0). If we can prove (47) in this special case with the 
indicated gain, then the general case in (47) follows by summing in k. 
This implies that 

^^<min(2 fc |^|,l)<(2 fe |^|r 

I-'qI 

for all these tiles Q G Q, < s < 1 and j = 1,2. We also have 

l r l ~ 

for any big number M. We also observe that for k > 5 the corresponding bi-tiles are 
essentially disjoint. Using all of these and from Lemma 10.6, Lemma 10.5, we thus have 

size 1 (((f 2 ,cf> Ql )) Q )<(2 k \E 2 \y 

en^riy 1 (((/ 2 ,0 Ql )) Q )<| J E 2 | 1 / 2 

s™e(((hX{x:N(x)eu Q },M)Q) < 1 

size e (((/ 3 x { x:iv(x) e ^},0Q 1 ))Q) < 2~ Mk 

^gy(((hX{x-.N{x)^ Q }iM)Q) < i- 

By Proposition 14.1 we thus can bound the left-hand side of (47) by 

(2 k \E 2 \) sei (2~ Mk ) e2 \E 2 \ ( - 1 - ei)/2 (((2 k \E 1 \) a + l^n = 2- Xk \E 1 \ a \E 2 \ sei - 8l/2+1/2 , 

where 9\, 9 2 , a are as in Proposition 14.1 and A is a number depending on them. Clearly, 
a can be chosen arbitrarily inside the interval (0, 1) if the e in Proposition 14.1 is small 
enough. Similarly, the exponent s9\ — #i/2 + l/2 can be chosen arbitrarily close to (if 
6>! is close to 1 and s is close to 0) and also arbitrarily close to 1 (if 9\ is close to 1 and 
s is close to 1) and this finishes the proof, since A can always be made positive if M is 
chosen big enough. 

16. The tree estimate 

We first recall the following crucial Lemma which is a particular case of Proposition 
3.6 in [13]. 

Lemma 16.1. Let T be a 2-tree of bi-tiles and let f,g be two arbitrary functions. Then, 
^2\{f,M(9,M\ Z size 1 (((f,<f)p 1 ))p) 1 - ei size 1 (((g,<f) Pl )) P ) 1 - e2 - 

•energy(((/, Pl )) P ) 9l e'5ergy(((^, <p Pl )) P ) e2 
for any < 9±, 9 2 < 1, 9± + 9 2 — 1 where 
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energy (((/,0 Pl )) P ) := sup 



DCT 



(\flxD 



E WJ h A. 



1,00 



where D ranges over all subsets ofT so that the intervals {Ip : P e D} are disjoint. 

Fix now the collections of bi-tiles P and Q. We begin our study of the form A p q by 
considering the contribution of a single tree (the reader should recall the definition of the 
form A p q given in Section 14 ). 

Lemma 16.2. Let e > a small number, let T be a tree in Q and /i,/2,/3 be three 
functions as in Proposition I4.I. Then, the following estimate holds: 



|Ap,T(/l,/2,/ 3 )| 



< 



(48) 



< 



sizei(((/ 2 , M)QeT)size(((f 3 X{x:N(x)eu, Q2 }, 0q 1 ))q s t) £ |/t 



BU p(^^»)i-« + | jE 7 1 |« 
^ MO I 

/or any < a < 1 — e. Moreover, if the bi-tiles in Q are disjoint, then the expression 

size(((f 3 X{x:N(x)€iV Q2 } , 0Q X ) )q G T ) 

can be replaced by the smaller quantity 

SlZe e ( ( {fzX{x:N(x)&LOQ 2 } , 0Qi ) )qgt) • 

Proof: Let J7" be the collection of all maximal intervals J inside our fixed dyadic grid 
such that 3J does not contain any Iq with Q e T. Then, clearly, ^ is a partition of the 
real line R. The left hand side of (48) can be written as 



|Ap,T(/l,/2,/3)| 



E 

QeT 



(f2,(pQ 1 )( ( l ) Q 1 X{x/N(x)eu> Q2 } ^2 (fl,<f>Pi)<t>PiX{x/N{x)eup 2 },f3) 

U)Q 2 C\U)p 2 jt%\\wQ 2 \<\u)p 2 \ 



y^(/2, <l>Q 1 ){(l>Q 1 X{x/N(x)eu> Qo }CQ(fl), / 3 ) 



( ^(f2,<l>Qi)<l>QiX{x/N(x)eu,Q 2 }CQ(fi)f3 J 



JeJ^ \QeT 



< 



/ S {f^ < t>Qi) < t>QxX{x/N{x)^ Q2 }C Q {f 1 )h \{x)dx 

1 CL T J J \ n^rpA r _ I I Tl / 



Jej""' \QGT:|/ Q |<|J| 
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E 



/ (h^M ( f ) QiX{x/N(x)eu> Q2 }CQ(fi)f3 \(x)dx 

Jj \QeT:\I Q \>\J\ J 



:=/ + //. 

We first estimate term I. Fix J e J and Q G T with \Iq\ < \J\. We claim that 



< 



(/2, <f>Q 1 )<l)Q 1 X{x/N(x)e^Q 2 }CQ(fi)f3) (x)dx 
sizei(((/ 2 , 0Q 1 ))Q e T)size(((/3X{x:JV(x)e^}, 0Qi))oeT) 1_e - 



sup^ 



-r e + l£l 



for any big constant m. Assume that (49) holds. We also have 

y A + diBt(/ Q .j) y" |Jg|<2t / 1 + diB t (/ r ,j) 

and after summing over k with 2 k < | J| and over J <E J this gives the bound 

dist(ir, J)' 



JeJ 



\It\ 



< \It\. 



(49) 



This, together with (49) gives the desired estimate. Thus, it remains to prove (49). 

We first observe that in order to estimate (49) it is enough to estimate expressions of 
the form 



(f2, <!>Qi)<f>QiX{x/N(x)eu, Q2 }(^2 <M)<M)./3Gb 



Per' 



(50) 



where P runs inside a 2-tree T' so that all the |/p|' s are smaller than \Iq\. Now we split 
the tiles P in T' as T" = IJd>o where contains all the tiles having the property that 

d i3 t(/p,tm 



I AT 



We also decompose the interval J as J = (Ji=i ^ where the Jj' s are disjoint intervals so 
that | Jj| = \Iq\ for every « = 1, ...,N. The expression in (50) can be majorized by 



N 



EE 

d>0 i=l 



(/2, 0Qi)0QiXWJV(*)eu, Q2 }( ^ (/i> <PPi)<PPi)hdx 



PeT' d 



Fix now « = 1, ...,N and d > and look at the corresponding expression. We decompose 
it again into 
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+ 



/ (f2,(f>Q 1 )<l>Q 1 X{x/N{x)ewQ 2 }{ ^2 (fi'M^P^hdz 
Ji PgT , 



(f2,<j)Q 1 )(j)Q 1 X{x/N(x)£u> Q2 }( (fl'M^^hdz 



:=A + B 

where T' dl contains those tiles P so that Ip C 2Jj while T' d2 contains those tiles so that 
Ip £ 2J-. 

We now concentrate on B. It can be majorized by 



sizei(((/ 2 , Qi ))q g t)(sup —± 

p u 



R 



X? P )Xi Q Xn-XJihdx 



p^ 2 



< sizei(((/ 2 ,</>Q 1 ))Q G T 



)(su P ^)2-^fi + dist , (/Q ;^Y 

p \ I P\ \ \Iq\ J 



•^{{{hX{x:N(x)eu JQ }AQ l ))QeT)\lQ\- 

Now, after summing over i — 1, N we obtain the bound 

8*ei(«/2, Ql > W)(sup %^)2-^ (l + ^pl^l) . 

p I-'pI V MqI / 

•size(((/3X{x:iV(x)e«Q},^0i))06T)|i'Q|- 

Since we also know that P G 7^, we have in particular that 

(sup < min(2 d |E 1 |, 1) < 2^1", 

p \-ip\ 

for every a E (0,1). Also, since size(((f 3 X{x-.N(x)^ Q }, 0Qi))qgt) < 1 it follows that 
to&>(((fsX{x:N{x)&j Q }, M)q&t) < size(((f 3 X{x-.N(x)eu J Q}, 0Qi))QeT) 1_e and as a consequence, 
after summing over d > 0, the new bound is the desired one. 
We now concentrate on A. It can be majorized by 



size i ( ( {f 2 , <f> Ql ) ) Qer) ^ KA > ) I K/sfe JV(x)e^ Q2 }X J* Xf^ c , <M I 
Pe^i 



(51) 



where <pg 1 := (pg^Ig] 1 ^ 2 is an L°° normalized bump addapted to the interval Ig. We want 
to apply Lemma 16.1 to handle this sum. As before, since P e T' d we have the estimate 



sizei(((/i,0 Pl )) P ) < sup 

p \ 1 p\ 



<mm(2 d \E l \,l)<2 ad \E 1 \ a . 



Also, 
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sizei 



(«/s0q 1 X{*:N(x) & ^}Xj,Xoc,^»p) < 2- md (l + dlst ^ J ^ 



To estimate energy(((/i, 0p 1 ))p) fix a set D as in Lemma 16.1 so that the supremum is 
attained. Since all the Jp' s are inside 2Jj, we can write 



energy(((/i,0 Pl )) P ) 



PeD 





lXi P ) 




ip 





Xh 



1,00 



< 



{\fi\XJi,Xi P ) 



|7 



-Xh 



PeD 



< 



\m\h\u 



1,00 



< IH/--,K,,l|.l.,x ^ffe^)|/ol<|/o|. 



Finally, 



energy(((f 3 4) Ql X{ X :N(x)eu Q2 }XJiXnc,(f)Pi))p 

1 



< 



Xl Q X{x:N{x)£u Q }\h\dx 



Iq\ 



X/ Q X{x:JV(x)6a;Q}|/3Ma; |/q 



< 



By applying Lemma 16.1 we estimate (51) by 

size 1 (((/ 2 ,0 Ql )) OeT )(2^) 1 ^ [2- md 



1 + 



dist(I Q , Jj 



1-02 



• I J Q 1 61 (size ( ( (/3X{* : jv(*)eu, Qa } , 0Qi ) ) Qer ) | Iq | ) 02 , 

for any < 6\, 6 2 < 1 with 61 + 62 = 1. Now if we choose #2 = 1 — e, #1 = e and m big 
enough, we obtain again the desired bound after summing over d > and % — 1, ...,N. 
This ends the discussion on term I. 

We now estimate term II. First, we observe that the intervals J E J which contribute 
to the summation have the property that J C 3I T . We then split the tree T as T = T\ +T 2 
where 7\ is a 1-tree and T 2 is a 2-tree. As a consequence, our term II also splits as 



// = Ih + Ih- 

We first discuss term Hi. We also observe that our tree T' of P bi-tiles also splits as 
T' := T 2 ' + T[ where T[ is an 1-tree and T' 2 is a 2-tree and as before this implies a further 
decomposition of Ih as 

Ih = I ha + //lb- 

We concentrate on II la first. We can write it as 
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E / I E (f^^Qi) ( l ) QiX{x/N(x)eoj Q2 } E (fi,<f>Pi)<f>PiX{x/N(x)eup 2 }f3 J dx 

J Jj \\Iq\>\J\ " \uq 2 \<\up 2 \;\Ip\<\J\ ' J 

+ E / I E if 2 ^Qi) ( i ) QiX{x/N{x)eu> Q2 } E (fu<t>Pi)<t>PiX{x/N{x)eup 2 }h J dx 

J Jj \\Iq\>\J\ " K 2 |<|^p 2 |;|/p|>|J| " / 



e/, £ • ■ £ 

J JJ \\Io\>\J\ PeKilwooKlwful;! 



|/q|>|J| Pe^;|c Q2 |<|^p 2 |;|/p|<|J|;/pC2J 



+E / f E ••• E ••• 

J 3 \\Iq\>\A P&^ Q2 \<\u P2 W P \<\J\;I P pJ 



+E/ E ■■■ E • • I <<■>■ 

J 1 \l / 0l > l J l Pe^;|^Q 2 |<| W p 2 |;|/p|>|J|;/pC5/ T 



+£/,£■■■ £ -I'- 

J J \|Jgl>|J| PeT>;\u Q2 \<\iVp 2 \;\I P \>\J\;Ip£5I T 



— TP + jj" i /■/■'" i 77"" 

•— 11 la^ 11 la^ 11 la^ 11 la- 

We will treat them one by one. We start with II' la . Fix an interval J and look at the 
corresponding function under the integral. It is equal to 



X{x/AT(x)e^p 2 } hXJXnc 

k \Iq\>\J\ / \PeT>;\lp\<\J\;lpQ2J 

and this is pointwise smaller than 



sizei(((/ 2 ,0Q 1 ))QeT) I ^ (h,4>Pi)<t>PiX{x/N{x)eup 2 } fsXJXnc- 

\P£T>;\Ip\<\J\;IpC2J J 

On the other hand, let now J' be an interval in the dyadic grid which contains J and 
| J'| = 2\J\. By the maximality of J, it follows that 3J' contains an interval 7q, for some 
Q G Ti. Then, let Qj G Q be a tile with |/ Qj | = | J'| and so that Q < Qj < Qt- Clearly, 
the support of the above function is included inside the set {x : N(x) G ujq t }. As a 
consequence of these two facts, we can estimate the integral on J by 



sizei(((/ 2 ,0Q 1 ))Q eT )- 
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E 

d>0 



li 



^ (fl,(l>P 1 )<l>P 1 X{x/N(x)eu>p 2 } 
PeT^nP d ;\Ip\<\J\;I P C2J 



X{xeJ:N(x)eu Q Ahx^dx 



< 



sizei(((/ 2 ,0Q 1 ))Qgr) 



(52) 



■E 

d>0 



li 



E 



(/l, <t ) Pi)4>PiX{x/N(x)eu J p 2 } 



PeT£niP d ;\Ip\<\J\;I P C2J 



X{x:N(x)^ Qj }hXn-XQjdx 



Fix now d > 0. To estimate the above integral, it is clearly enough to estimate expressions 
of the form 



/ I E (fi,<f>Pi)<f>PiX{x/N(x)eu>p 2 } J X{x:N(x)eu lQj }f3Xn c X < Q J hdx, 

JH yp G ^ nPd .|/ p | < |j|./ p c2j 

where h G L 00 , ||/i||oo < 1- This can be further majorized by 



E 



(53) 



PeT^nP d ;|/p|<|J|;/ P C2J 



To estimate this last expression, we need to apply again Lemma 16.1, in the same manner 
as we did when we estimated (51). Thus, (53) can be majorized by 



2- Ad size(((/3X{^)^ Q },0Q 1 ))Q e T) 1 - £ | J Eir|/Q. 7 | 

= 2- Xd size(((f 3X {x^x)^ Q }AQ 1 ))QeT) 1 ~ e \E 1 \ a \J\, 

where A is a positive number and < a < 1 — e. Using this in (52), after summing over 
d and J, we obtain the desired bound. 

We now estimate II" a . Just by taking advantage of the decay coming from products of 
type " 4>q 1 • xj " > we can easily estimate it by 



Ad ^size 1 (((/ 2 ,0Q 1 ))Q eT )size(((/ 3 X{ :c: JV(x)^ Q },0Q 1 ))(3eT) 1 I^iTIQjI 

d>0 J 

< sizei(((/ 2 , ^Q^QeT^izeidf^Xix-.Nix^Q}, ^Q^QeT^E^l J\ 
J 

< size 1 (((f 2 ,(f)Q 1 })Q € T)size(((f 3 X{x:N(x)^ Q },(l)Qi})Q€T) 1 ~ e \E 1 \ a \lT\. 
To estimate II"' a fix again an interval J and look at the corresponding term under the 
integral. It is given by 



E (^2) 4 > Qi)4 > QiX{x/N(x)eLL)Q 2 } ^2 (fl,<f>Pi)<f>PiX{x/N{x)eup 2 }f3 

\Iq\>\J\ |«q 2 |<|w PSi |;|/p|>|J|;/pC57t 
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:=Hj. 

Fix now x G J. We first observe that since Q ranges inside a tree of type 1, all the sets 
of the form " {x/N(x) G ujq 2 } " are disjoint if the tiles involved have different scales. As 
a consequence, for our particular fixed x, there is only one Q-scale that contributes. Let 
us denote this unique scale (which depends on x) by L. As a consequence, one can write 
Hj{x) as 



E (foMfa p) E (/i-Wfe]W/3(4 

|7 |>|J|;|«g|=L / \\I P \>\J\;I P C5I T ;L<\u, P2 \ / (54) 

Since the intervals ujp 2 are nested, it follows that there is a largest (resp. smallest) interval 
u + (resp. of the form uj p such that the term in the middle of the product in (54) 
equals 



E 

IpC5It;\u-\<\ujp\<\u> + \ 



(fuMfa ) (x)= ( E </l,Wtfft) *(*+"*-)(*) 
/ \/pC57 T / 



where are well chosen bump functions so that |supp(\I/+)| ~ |c<j + | and |supp(\I/- 

In particular, this implies that this middle term is smaller than 



1 

sup — 

JCI |-f I J I 



E (friM&Pi d y 

I P C5I T 

which is a constant quantity on the interval J. As a consequence of thsese observations, 
our term II'" a can be estimated by 



sizei(((/ 2 , M)q) 



E (fi'Mfa 



I P C5I 7 



dy\[ hX{x/N{ x )eu Qj }dy 



< 



sizei(((/ 2 ,0 Ql )) Q )E f SU P^ / 



E (/i' < M)<M 

/ P C5/ T 



R 



Xi Q ,X{x/N{x)^ Q }fzdy 



< 



sizei(((/ 2 , (j) Ql )) Q )size(({f 3 X{x:N(x)euj Q }, M)q)- 







I sup 




uc/ 



E (fi'Mfa 

I P C5I T 



d y l J Qj 



< 



sizei(((/ 2 , 0Q 1 ))Q)size(((/ 3 x {a; :jv(a ; )e^Q} ) M)q)' 







I sup 




be; 



E (fi'M^Pi 

I P C5I T 



dy \J\ 
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< sizei(((/ 2 , M)Q)size(((f 3 X{x-.N(x)eu> Q }, M)q)- 



Pi 

I P C5I T 



dy Xu 



L!(3/ T ) 



< 



sizei(((/ 2 , <f)Q 1 })Q)size(((f 3 X{x:N(x)^ Q }, M)q)- 



I P C5I T 



L1(3/t) 



< 



sizei(((/ 2 , 0Q 1 ))Q)size(((/ 3 X{x:JV(x)ecQ}, M)q)- 



I P C5I T 



< 



sizei(((/ 2 , 0Q 1 ))Q)size(((/ 3 X{ ;C :7V( ;c )ec Q }, 0Qx))q) 



5^ (fuMfa 

I P C5I T 



\Ir\ e . 



Now we also have 



I P C5I T 



LV(i-<0 



X] (fuMfa (y)g(y)dy 



yI P C5I T 



for some (7 G ||<7||i/ e = 1. The last term is further smaller than 

E KA.#«>IIW»>I = if, E 

„r^c;r„ </JK, r i-r;r_ I -PI I -PI 



I P C5I T 



< 



■ I P C5I T 



< 



I/pI 



1/2 



1/2 



lLi/(i- £ ) ' 



\- l(^<M)l\ , . M 

— 171 — xip I Hi 1 



<— 11 (v '^^'V 



1/2 



| L l/(l-e) • (ItI 1 ^ ^ l-^Tl 1 ^ 



by using Lemma 10.6. Inserting this into (55) we obtain the desired bound. 
We now estimate II""- We first write it (as usual) as 
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d>0 

where the P-tiles inside " • • • " run inside the set T 2 fl P d . Then, fix d > and J and 
look at the corresponding integrand in II"". It is pointwise smaller than 



E 

fc>0 



E - E 

\I Q \>\J\;\I Q \~2- k \I T \ PeT^nP d ;|u,Q 2 |<|u,p 2 |;|/p|>|,/|;Jp£5/T 

Je, Xi r 



<2- rfm size 1 (((/ 2 ,0 Ql )) Q )(sup J -^L)J22- km hXncXjX {x -.N (x) ^ Qj} 

PeP. UpI 'r - ' 



fe>0 



< 2- dm siz ei (((/ 2 , 0Q 1 )) Q )2^| J E i r/3XjX { ^) e . gj }, 

for every < a < 1. Using this information, the integral over J is bounded by (after 
summing over <i > 0) 



sizei(((/ 2 , 0Q 1 )) Q )| J Bir y X{x:jv(x)e^ gj }/3(a;)^ 

< sizei(((/ 2 ,0o 1 ))g)size(((/ 3 X{x:JV(x)ea;Q 2 },0Oi))o)|-Siri^| 
and this, after summing over J & J, gives the desired bound. To finish the discussion on 
term JJi we need to discuss now term II lb . We first split it as 

E" 

as before. Since now both our trees T and V are 1-trees, it follows that the sets {x : 
N(x) G ujq 2 } and {x : N(x) G ujp 2 } are disjoint, if they correspond to different scales. 
As a consequence, for a fixed d > and J G J7", the corresponding integrand is pointwise 
smaller than 



sizei(((/ 2 ,0Q 1 ))o)( sup E \ P )2~ dm hxjX{x-.N{x)eu Ql } 

PGP d \1P\ 



<size 1 (((/ 2 ,0 Ql )) Q )2^| J E 1 | Q 2- dm /3XjX{, : iv(,) &Qj } 
for every < a < 1. Integrating this over J and summing over d > we obtain the 
bound 

size 1 (((/ 2 ,0Q 1 )) Q )size(((/ 3 X{ a; :jv( a; )G^ 2 },0Q 1 ))Q)l^iri- / 'l 
which, after summing over J G J, becomes the desired bound. 

It remains to estimate term II 2 . First of all, we write as before V = T' 2 + T[ where T[ 
is an 1-tree and T 2 is a 2-tree and this implies a decomposition of II 2 as 



Ih = I ha + Ihb- 

Furthermore, we split again the first term II 2a as 
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n 2a = n' 2a + ill + n'l + nZ 

where each of these terms correspond to the same summation constraints as before, when 
we decomposed the term ll\ a . 

We now estimate II' 2a . As usual we split it as J2 d>0 • • • ■ Then, we fix d > and J and 
look at the corresponding integrand. It can be written as 



^ (f2,4>Qi)<t>QiX{x/N{x)eu Q2 } ^2 t \hAPi)4>PiX{x/N{x)eu J p 2 )hX^XJ 



Mq\>\J\ 



< 



sup 

JCI 



JIp\<\J\;I P C2J 
5^(/2,^Oi)0Qi 

Q 



dy 



^ (fi, ( ! ) Pi) ( i ) PiX{x/N{x)eup 2 }hX£i c XJX{x/N(x)eu> Qj } 

\Ip\<\J\;I P C2J 

by using the same geometric arguments used to estimate term II 2a . Integrating over J 
this gives the bound 



sup-^ 

jci U 



Q 



<l!/)- Yl \(fiAp 1 )\\(f3Xn-hxjX{x/N(x)eu Qj },(t)Pi)\, 

\Ip\<\J\;I P C2J (56) 



for some h G L°°, 
16.1) by 



< I. The last sum can be estimated as before (using Lemma 



2 , lEir) i- e(2 - m , r|jr(size(((/3X{ ^ )g ^ };0Qi))Q)|J|) 



\ 1— e fc\—md 



1-e 



= 2 -Ad |Eir (l^) (size(((/3X{ ^ )g ^ }50Qi))Q)) l- V|? 

for some A > 0, if m is big enough. As a consequence, after summing over d > the term 



II 2a becomes smaller than 



l^l^ 1 e) {size{{(f 3X { X :N(x)^ Q2 }AQ 1 ))Q)) 1 £ 5Z( SUP r^ / 

j \JQI I I Jl 
and this, by an argument used before is smaller than 



53</2,^Qi)0Qi 



| J| 



e) (size(((/ 3 X{ ;C :7V( ;c )ec Q2 },0Q 1 ))Q)) 1 e sizei(((/ 2 , Ql )) Q )|/ T |. 
We now estimate II 2a . As before, we first we decompose it as J2d>o " ' • Then, we fix 
d > and J and look at the corresponding integrand. It can be written as 



^ (f2,<f>Qi)</>QiX{x/N{x)eu>Q 2 } J I ^ (fl,<PPi)(pPiX{ X /N(x)€iUp 2 }f3Xn-XJ 

\I Q \>\J\ J \\Ip\<\J\;Ipp.J /57) 
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~ I sup TTi 

jci \1 



~ I SU P I 7-1 

JCI Ul 



dy ( sup )2 /3XJX{x/JV(*)e«g 7 } 

/ pfPj Up j 



5^(/2,^Oi>0Oi 



J 2 dQ |E 1 | a 2 md /3XjXWAT(x)Gw Qj } 



for < a < 1. After we integrate over J we get the bound 



2~ Xd \E 1 \ a I sup 



jci U | J i 



Xl(/2,0Qi>0Qi 
Q 



ofa/ size(((/ 3 X{x : 7V(x)Go; Q2 },0Q 1 ))Q)|d 



Summing now over d and J we obtain the desired bound. 

We now estimate term 11^ as follows. Fix J G J . The corresponding integrand can 
be written as 



^ (f2,<pQi) ( l ) QiX{x/N(x)€^ Q2 } ^2 (fl,<f>Pi)<l>PiX{x/N(x)eu,p 2 }f3XJ- 

\Iq\>\J\ | WQ2 |<|o;p 2 ||/p|>|J|;7pC57 T (58) 

Using a similar analysis on the geometry of the frequency intervals, as in the case when 
we estimated term II"' a , the above term (58) is pointwise smaller than 



sup — 
jci u | J i 



n(^(/ 2 ,0 Ql )0Q 1 ,^(/i,0P 1 )0P 1 ) 



Q 



+ | sup — 

JCI \i I J I 



SU PT7T / 
jci \1 1 J I 



Pi 



5^</2,0Qi)^Qi 
Q 

where II is a paraproduct well adapted tp the frequency intervals of the trees T 2 and T' 2 . 
Notice that this expression is constant on the interval J. After integrating over J and 
summing over all the intervals in J we can bound term ll'£ a as before by 



^{{{hX{x:N(x)doj Q2 }, 0Qi))q) 



M(u(J2(f2, M<f> Ql ,J2(fi> MM) 



L\ZI T ) 



+size(((f 3 X{x:N(x)^ Q2 }, 0Qi))o) 



M(£(f2, MMMCEifu MM 

Q P 



£i(3/ T ) 



< 



^{{{hX{x:N{x)&w Q2 }, M)0) 



Ti(J2(f2,M<P Ql ,J2(^MM 



|/rl e/(1+e) 



L^+t{3I T ) 



+size(((f 3 X{x-.N(x)eu JQ2 }, M)q) 



^(f2,M<f>Qi 

Q 



^2(fuMM 



£1/(1-0 
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< 



size(({f 3 X{x:N(x)^ Q2 }, M)i 



|/rl e/(1+e) 



^2(f2,<j>Qi)<f>Qi 



£(i+OA 2 



£1/(1-0 



+Size(((/ 3 X{o : :Ar(a ; )6a;Q 2 }, 4>Qi))q) 



Q 



^{h,<t>Pi)<t>Pi 



£1/(1-0 



< 



size(((/ 3 X{x:JV(x)G^Q 2 },0Qi))Q)size 1 (((/ 2 ,0Q 1 ))Q)(— / xi 



\l-ei 



L T\ J Ex 

which is what we wanted. Term II^ can be estimated in a similar way with term II"£ 
and is left to the reader. It remains to estimate term II 2 b in order to finish the whole 
proof. We first split it as usual, as J2a>o " ' ■ We then fix d > and J and look at 
the corresponding integrand. Since P runs now inside a tree of type 1, the intervals 
{x : N(x) G ujp 2 } are disjoint if they correspond to different scales. In particular, for a 
fixed x G J there is only one scale that contributes to our summation. As a consequence, 
we can pointwise estimate our integrand by 





1 




sup 




/ 


\jci 





Q 



dy 



( lE 1 X-IP N _ m( i „ 

{ SU P m ) Z J3XJX{x/N(x)ew Q ,} 

PeP d \lp\ 



(59) 



and this, as we have already seen, leads to the right estimate. This ends the proof of our 
inequality. 

In the particular case when all the bi-tiles in Q are disjoint, every tree is an "one bi-tile 
tree" and the whole proof becomes much simpler. As a consequence, one easily observes 
that size[...] can be replaced by size e [...]. The proof is now complete. 



17. Combinatorial lemmas 

In order to prove Propositions 10.8 and 14.1 we need to recall certain standard com- 
binatorial Lemmas. To bootstrap the summation over a single tree T as in Lemma 16.2, 
to a summation over the whole Q, we would like to partition Q into trees T for which 
one has good control over J2 T \I T \. This will be accomplished by several decomposition 
lemmas. The first one appeared in [10] (see Proposition 12.2.). 

Proposition 17.1. Let j = 1,2,3, Q' be a subset of Q, n G Z, / be a function and 
suppose that 

size J (((/,0Q.)) Q6Q /) < 2- n ^F£j j ( y ( y (f,(j) Q] )) QeQ ). 
Then we may decompose Q' = Q" U Q w such that 

size,(((/,0 Qj )) QeQ ») < 2- n - 1 ^F£j J (((f, ( j )Q] )) QeQ ) (60) 
and that Q w can be written as the disjoint union of trees T such that 

E IJH < 2 2 ". (61) 
TeT 

By iterating this proposition one obtains (see again [10], Corollary 12.3). 
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Corollary 17.2. There exists a partition 

Q= |J Qn 

where for each n G Z and j = 1, 2, 3 we have 

sizej(((/, 0Q 3 ))Q eQ J < min(2- n e'Se?gy i (((/, <j> Qj )) Q eQ), sizej(((/, 0q,))q 6 q)). 
t4/so 7 we may cower Q„ fry a collection T n of trees such that 

E s 22ri - 

T£T„ 

The next Proposition together with its corollary are also known (see [6], Proposition 
3.1). 

Proposition 17.3. Let Q' be a subset of Q, n G Z, f be a function and suppose that 

size(((fx{x/N(x)ew Q2 }, M)QeQ') < ^~ n ^gy(((fX{x/N(x)^ Q2 }, M)qeq)- 
Then we may decompose Q' = Q" U Q w such that 

size(((fx{ x /N(x)eu Q2 }, M)geQ") < ^~ n ~ 1 e^gy(((fX{x/N(x)eu> Q2 }, 0Qi»Qeq) 

(62) 

and that Q'" can be written as the disjoint union of trees T such that 

5>t|<2". (63) 
TeT 

Moreover, if all the bi-tiles in Q are disjoint, the Proposition holds if one replaces size[...] 
by size e [...]. 

Corollary 17.4. There exists a partition 

Q=\jQn 

n€Z 

where for each n G Z we have 

size(((fX{x/N(x)eu Q2 }, 0Qi»QeQ„) < 

min(2- n e'5ergy(((/xW7V(x)G^ Q2 }, Mheo), s™e(((fX{x/N( x )eu Q2 }, Mheo))- 
Also, we may cover Q n by a collection T n of trees such that 

E 1^1 < 2". 
TeT n 
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18. Proof of Propositions 10.8 and 14.1 



It remains to present the proofs of Propositions 10.8 and 14.1. We start with Proposition 
10.8. Fix Q a collection of bi-tiles and let clq 1 and bq 2 be complex numbers as before, 
given by 



and 



a Qi ■= (/i,0Qi) 

: (f2X{x/N(x)eu>Q 2 }, 



Fix also #i, 9 2 G (0, 1) so that 9i + 29 2 = 1. First of all, let us recall the following standard 
estimate (see 



QeT 



< 



sizei((aQ 1 )( 3eT )size((6Q 2 )Q GT )|/ T | 



(64) 



for any tree T in Q. During the proof, we will write for simplicity S± := sizei((ag 1 )Q e Q), 
S 2 := size((6 Q2 ) QeQ ), E 1 := energy! ((a Ql ) QeQ ) and E 2 := energy ((b Q2 ) QeQ ). 

If we apply Corollaries 17.2 and 17.4 to the functions and respectively, we obtain 
decompositions 

Q = IJ Qn 

n€Z 

for j = 1,2 such that each Q{ can be written as a union of subsets in T{ satisfying the 
properties of those Corollaries for j = 1, 2. In particular, we can write the left hand side 
of our desired inequality as 



eie 2 J2 E E a <?^ ( 65 ) 

ni,n 2 T€T n l' n 2 QeT 

where T" 1 '" 2 := fl T^ 2 . By using the above tree estimate (64) one can majorize (65) 
by 



E 1 E 2 J2^ ni ^ ri2 E IH (66) 

m,n 2 TeT n i' n 2 

where, according to the same Corollaries the summation goes over those ni,n 2 G Z such 
that 

2 -n 3 < 

for j = 1, 2. On the other hand we also know that we can estimate the inner sum in (66) 
in two different ways, namely 



|/r|<2 2rai , 2" 2 



(67) 
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< 2 2niai 2 n2a2 (68) 

for any < a±, a 2 < 1 with ai + a 2 = 1. Using (68), we can estimate (66) further by 

E 1 E 2 £ 2~ ni ( 1 ~ 2ai ^2~ n2 ( 1 ~ a '^ 

ni,ra 2 

< E 1 E2{—) 1 ~ 2ai {—) 1 ~ cn = S\- 2ai Sl~ a2 El ai E^, 
Ei E 2 

as long as 1 — 2ai > 0, 1 — a 2 > and a\ + a 2 = 1. Now, if we choose ai,a 2 such 
that #i = l — 2«i and 9 2 = 1 — a 2 we observe that 6>i + 29 2 = 1 and the last term above 
becomes 

b 1 b 2 ti l ti 2 

This proves our inequality in the case when 9i,9 2 £ (0,1)- We now prove the endpoint 
case #i = (and so 9 2 = 1/2). 
From (67) we have 

£ |/r| <min(2 2 "\2" 2 ) 

reT n i-"2 

and as a consequence, our sum in (66) can be estimated by 



E 1 E 2 2- ni 2~ n2 min(2 2ni ,2 ra2 ) 

ni,U2 

= E X E 2 2~™ 2 min ( 2ni > 2" ni 2" 2 ) 

ri2 ni 

< E,E 2 2-" 2 2^ = E X E 2 2"^ 



= E^Et&yi* = S 1 2 /2 E 1 E 1 2 /2 
E 2 

and this ends the proof of the main part of the Proposition. 

In the particular case when all the bi-tiles in Q are disjoint, one just has to observe 
that all the trees T in Q are "one bi-tile trees" and then to use the trivial inequality 



< sizei((a Ql )Q eT )size e ((6Q 2 )Q eT )|/ T | (69) 



instead of the previous (64). 

The proof of the remaining Proposition 14.1 is very similar and will be omitted. The 
only difference is that one has to use Lemma 16.2 instead of the tree estimate (64). The 
extra term "[...]" (which did not appear in the proof of Proposition 10.8 ) is harmless in 
the whole process and can be factored out. This is why it is the same in the statements 
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of Lemma 16.2 and Proposition 14.1. Finally, when one keeps track of the "numerology", 
one ends up with a condition depending on e, instead of the previous 6\ + 29 2 = 1. 
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